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(57) A method 300 of aligning nodes of dependency 
structures obtained from a bilingual corpus includes, in 
one aspect, a two-phase approach wherein a first phase 
302 comprises associating nodes of the dependency 
structures to form tentative correspondences. The 
nodes of the dependency structures are then aligned as 



a function of the tentative correspondences and/or 
structural considerations in phase 304. Mappings are 
obtained from the aiigned dependency structures. The 
mappings can be expanded with varying types and 
amounts of iocal context in order that a more fiuent 
translation can be obtained when translation is per- 
formed 
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Description 

BACKGROUND OF THE INVENTION 

5 [0001] The present invention relates to autonnated language translation systems. IVIore particularly, the present in- 
vention relates to extracting transfer nnappings automatically from bilingual corpora, the mappings associating words 
and/or logical forms of a first language with words and/or logical forms of a second language. 

[0002] Machine translation systems are systems that receive a textual input in one language, translate it to a second 
language, and provide a textual output in the second language. Many machine translation systems now use a knowl- 

10 edge base having examples or mappings in order to translate from the first language to the second language. The 
mappings are obtained from training the system, which includes parsing sentences, or portions thereof, in parallel 
sentence-aligned corpora in order to extract the transfer rules or examples. These systems typically obtain a predicate- 
argument or dependency structure for source and target sentences, which are then aligned, and from the resulting 
alignment, lexical and structural translation correspondences are extracted. The transfer mappings represent these 

15 correspondences or associations. 

[0003] Translation systems that automatically extract transfer mappings (rules or examples) from bilingual corpora 
have been hampered by the difficulty of achieving accurate alignment and acquiring high quality mappings. For in- 
stance, the alignment and transfer-mapping acquisition procedure must acquire mappings with very high precision and 
be robust against errors in parsing, sentence-level alignment and in the alignment procedure itself. It can also be 

20 desirable that the acquisition procedure produce transfer mappings that provide sufficient context in order that a fluent 
translation from the first language to the second language is obtained during translation. However, as the size or 
specificity logical forms of the mappings increase, the general applicability of the trained system may decrease. 
[0004] There is thus a need to improve upon machine translation systems. Systems or methods that address one, 
several or all of the aforementioned problems would be very beneficial. 

25 

SUMMARY OF THE INVENTION 

[0005] Logical forms are first obtained from a bilingual training corpus. A logical form is a data structure (parent/child) 
that describes labeled dependencies among content words in corresponding text such as a sentence. A method of 

30 associating the logical forms to obtain transfer mappings includes associating nodes of the logical forms to form ten- 
tative lexical correspondences. The tentative correspondences are aggressively pursued in this phase such that there 
may be more than one association between some of the nodes of the logical forms. In the second phase, the nodes 
of the logical forms are aligned by eliminating competing tentative correspondences and/or as a function of the structural 
considerations of the logical forms. 

35 [0006] To eliminate competing tentative correspondences and/or analyzing the structural considerations of the logical 
forms, a set of rules can be used. The rules are ordered to create the most unambiguous alignment ("best") first and 
then use these alignments to disambiguate subsequent alignments. In other words and as a separate aspect of the 
present invention, the rules are applied to the nodes initially irrespective of the parent/child structure in order to create 
the strongest, most unambiguous alignments first. After establishment of the most meaningful alignments first, the rest 

40 of the logical forms are then aligned outwards from these points. 

[0007] The aligned logical forms are used to create mappings that can be used during run time translation of a first 
language to a second language. As another aspect of the present invention, mappings can be varied in the amount 
and type of context in order to create competing or overlapping mappings that have common elements. During run 
time translation, the competing mappings are analyzed in order to choose, for example, the largest mapping or the 

45 one with the most context, and/or the use of other criteria. The use of a larger mapping can provide a more fluent 
translation; however, by having mappings of varying context, general applicability of the system is maintained. 
[0008] Although described above with respect to logical forms, other dependency structures including parse trees, 
predicate-argument structures or other explicit or implicit structural representations of two sentences (e.g. a bracketed 
linear representation) are applicable in the present invention. 

50 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0009] 

55 FIG. 1 is a block diagram of an illustrative environment in which the present invention may be used. 

FIG. 2 is a block diagram of a machine translation architecture in accordance with one embodiment of the present 
invention. 

FIG. 3A is an example of a logical form produced for a textual input in a source language (in this example, Spanish). 
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FIG. 3B is a linked logical form for the textual input in the source language. 

FIG. 3C is a target logical form representing a translation of the source language input to a target language output 
(in this example, English). 

FIG. 4 is a flow diagram illustrating a method for aligning nodes. 
5 FIG. 5A is an example of tentative correspondences formed between logical forms. 

FIG. 5B is an example of aligned nodes formed between the logical forms of FIG. 5A. 
FIG. 6 is a flow diagram illustrating application of a set of rules to the method of FIG. 4. 
FIG. 7 is a flow diagram illustrating application of an ordered set of rules. 
FIG. 8 is a set of transfer mappings associated with the example of FIG. 5B. 

10 

DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS GENERAL OVERVIEW 

[0010] The following is a brief description of a general purpose computer 120 illustrated in FIG. 1 . However, the 
computer 120 is only one example of a suitable computing environment and is not intended to suggest any limitation 
^5 as to the scope of use or functionality of the invention. Neither should the computer 120 be interpreted as having any 
dependency or requirement relating to any one or combination of modules illustrated therein. 

[0011] The invention may be described in the general context of computer-executable instructions, such as program 
modules, being executed by a computer. Generally, program modules include routines, programs, objects, modules, 
data structures, etc. that perform particular tasks or implement particular abstract data types. The invention may also 

20 be practiced in distributed computing environments where tasks are performed by remote processing devices that are 
linked through a communications network. In a distributed computing environment, program modules may be located 
in both local and remote computer storage media including memory storage devices. Tasks performed by the programs 
and modules are described below and with the aid of figures. Those skilled in the art can implement the description 
and figures as processor executable instructions, which can be written on any form of a computer readable media. 

25 [0012] With reference to FIG. 1, modules of computer 120 may include, but are not limited to, a processing unit 140, 
a system memory 1 50, and a system bus 1 41 that couples various system modules orcomponents including the system 
memory to the processing unit 140. The system bus 141 may be any of several types of bus structures including a 
memory bus or memory controller, a peripheral bus, and a local bus using any of a variety of bus architectures. By way 
of example, and not limitation, such architectures include Industry Standard Architecture (ISA) bus. Universal Serial 

30 Bus (USB), MicroChannel Architecture (MCA) bus. Enhanced ISA (EISA) bus, Video Electronics Standards Association 
(VESA) local bus, and Peripheral Module Interconnect (PCI) bus also known as Mezzanine bus. Computer 1 20 typically 
includes a variety of computer readable mediums. Computer readable mediums can be any available media that can 
be accessed by computer 1 20 and includes both volatile and nonvolatile media, removable and non-removable media. 
By way of example, and not limitation, computer readable mediums may comprise computer storage media and com- 

35 munication media. Computer storage media includes both volatile and nonvolatile, removable and non-removable 
media implemented in any method or technology for storage of information such as computer readable instructions, 
data structures, program modules/components or other data. Computer storage media includes, but is not limited to, 
RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disks (DVD) or other 
optical disk storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or 

40 any other medium which can be used to store the desired information and which can be accessed by computer 120. 
[0013] Communication media typically embodies computer readable instructions, data structures, program modules 
or other data in a modulated data signal such as a carrier wave or other transport mechanism and includes any infor- 
mation delivery media. The term "modulated data signal" means a signal that has one or more of its characteristics 
set or changed in such a manner as to encode information in the signal. By way of example, and not limitation, com- 

45 munication media includes wired media such as a wired network or direct-wired connection, and wireless media such 
as acoustic, FR, infrared and other wireless media. Combinations of any of the above should also be included within 
the scope of computer readable media. 

[0014] The system memory 150 includes computer storage media in the form of volatile and/or nonvolatile memory 
such as read only memory (ROM) 151 and random access memory (RAM) 152. A basic input/output system 153 
50 (BIOS), containing the basic routines that help to transfer information between elements within computer 120, such as 
during start-up, is typically stored in ROM 151. RAM 152 typically contains data and/or program modules that are 
immediately accessible to and/or presently being operated on by processing unit 140. By way of example, and not 
limitation, FIG. 1 illustrates operating system 154, application programs 155, other program modules 156, and program 
data 157. 

55 [0015] The computer 120 may also include other removable/non-removable volatile/nonvolatile computer storage 
media. By way of example only, FIG. 1 illustrates a hard disk drive 161 that reads from or writes to non-removable, 
nonvolatile magnetic media, a magnetic disk drive 1 71 that reads from or writes to a removable, nonvolatile magnetic 
disk 172, and an optical disk drive 175 that reads from or writes to a removable, nonvolatile optical disk 176 such as 
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a CD ROM or other optical media. Other rem ovable/non -removable, volatile/nonvolatile computer storage media that 
can be used in the exemplary operating environment include, but are not limited to, magnetic tape cassettes, flash 
memory cards, digital versatile disks, digital video tape, solid state RAM, solid state ROM, and the lil<e. The hard disk 
drive 161 is typically connected to the system bus 141 through a non-removable memory interface such as interface 
5 160, and magnetic disk drive 1 71 and optical disk drive 1 75 are typically connected to the system bus 141 by a remov- 
able memory interface, such as interface 170. 

[0016] The drives and their associated computer storage media discussed above and illustrated in FIG. 1 , provide 
storage of computer readable instructions, data structures, program modules and other data for the computer 1 20. In 
FIG. 1 , for example, hard disk drive 1 61 is illustrated as storing operating system 1 64, application programs 1 65, other 
10 program modules 1 66, and program data 1 67. Note that these modules can either be the same as or different from 
operating system 154, application programs 155, other program modules 1 56, and program data 157. Operating system 
164, application programs 165, other program modules 166, and program data 167 are given different numbers here 
to illustrate that, at a minimum, they are different copies. 

[0017] A user may enter commands and information into the computer 120 through input devices such as a keyboard 
15 1 82, a microphone 1 83, and a pointing device 1 81 , such as a mouse, trackball or touch pad. Other input devices (not 
shown) may include a joystick, game pad, satellite dish, scanner, or the like. These and other input devices are often 
connected to the processing unit 1 40 through a user input interface 1 80 that is coupled to the system bus, but may be 
connected by other interface and bus structures, such as a parallel port, game port or a universal serial bus (USB). A 
monitor 184 or other type of display device is also connected to the system bus 141 via an interface, such as a video 
20 interface 1 85. In addition to the monitor, computers may also include other peripheral output devices such as speakers 
187 and printer 186, which may be connected through an output peripheral interface 188. 

[001 8] The computer 1 20 may operate in a networked environment using logical connections to one or more remote 
computers, such as a remote computer 194. The remote computer 194 may be a personal computer, a hand-held 
device, a server, a router, a network PC, a peer device or other common network node, and typically includes many 

25 or all of the elements described above relative to the computer 120. The logical connections depicted in FIG. 1 include 
a local area network (LAN) 191 and a wide area network (WAN) 193, but may also include other networks. Such 
networking environments are commonplace in offices, enterprise-wide computer networks, intranets and the Internet. 
[0019] When used in a LAN networking environment, the computer 120 is connected to the LAN 191 through a 
network interface or adapter 190. When used in a WAN networking environment, the computer 120 typically includes 

30 a modem 1 92 or other means for establishing communications over the WAN 1 93, such as the Internet. The modem 
192, which may be internal or external, may be connected to the system bus 141 via the user input interface 1 80, or 
other appropriate mechanism. In a networked environment, program modules depicted relative to the computer 120, 
or portions thereof, may be stored in the remote memory storage device. By way of example, and not limitation, FIG. 
1 illustrates remote application programs 195 as residing on remote computer 194. It will be appreciated that the 

35 network connections shown are exemplary and other means of establishing a communications link between the com- 
puters may be used. 

[0020] The invention is also operational with numerous other general purpose orspecial purpose computing systems, 
environments or configurations. Examples of well known computing systems, environments, and/or configurations that 
may be suitable for use with the invention include, but are not limited to, regular telephones (without any screen) 
40 personal computers, server computers, hand-held or laptop devices, multiprocessor systems, microprocessor-based 
systems, set top boxes, programmable consumer electronics, network PCs, minicomputers, mainframe computers, 
distributed computing environments that include any of the above systems or devices, and the like. 

OVERVIEW OF MACHINE TRANSLATION SYSTEM 

45 

[0021] Prior to discussing the present invention in greater detail, a brief discussion of a logical form may be helpful. 
A full and detailed discussion of logical forms and systems and methods for generating them can be found in U.S. 
Patent No. 5,966,686 to Heidorn et al., issued October 12, 1 999 and entitled METHOD AND SYSTEM FOR COMPUT- 
ING SEMANTIC LOGICAL FORMS FROM SYNTAX TREES. Briefly, however, logical forms are generated by perform- 

50 ing a morphological analysis on an input text to produce conventional phrase structure analyses augmented with gram- 
matical relations. Syntactic analyses undergo further processing in order to derive logical forms, which are data struc- 
tures that describe labeled dependencies among content words in the textual input. Logical forms can normalize certain 
syntactical alternations, (e.g., active/passive) and resolve both intrasentential anaphora and long distance dependen- 
cies. As illustrated herein, for example in FIG. 3A, a logical form 252 can be represented as a graph, which helps 

55 intuitively in understanding the elements of logical forms. However, as appreciated by those skilled in the art, when 
stored on a computer readable medium, the logical forms may not readily be understood as representing a graph. 
[0022] Specifically, a logical relation consists of two words joined by a directional relation type, such as: LogicalSub- 
ject, LogicalObject, IndirectObject; 
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Logical Nominative, LogicalComplement, LogicalAgent; 
CoAgent, Beneficiary; 
l\/lodifier, Attribute, SentenceModifier; 
Prepositional Relationsliip; 
5 Synonym, Equivalence, Apposition; 
Hypernym, Classifier, Subclass; 
Means, Purpose; 

Operator, Modal, Aspect, DegreeModifier, Intensifier; 

Focus, Topic; 
10 Duration, Time; 

Location, Property, Material, Manner, Measure, Color, Size; 

Characteristic, Part; 

Coordinate; 

User, Possessor; 
15 Source, Goal, Cause, Result; and 

Domain. 

[0023] A logical form is a data structure of connected logical relations representing a single textual input, such as a 
sentence or part thereof. The logical form minimally consists of one logical relation and portrays structural relationships 
(i.e., syntactic and semantic relationships), particularly argument and/or adjunct relation(s) between important words 
20 in an input stnng. 

[0024] In one illustrative embodiment, the particular code that builds logical forms from syntactic analyses is shared 
across the various source and target languages that the machine translation system operates on. The shared archi- 
tecture greatly simplifies the task of aligning logical form segments from different languages since superficially distinct 
constructions in two languages frequently collapse onto similar or identical logical form representations. Examples of 

25 logical forms in different languages are described in greater detail below with respect to FIGS. 3A-3C. 

[0025] FIG. 2 is a block diagram of an architecture of a machine translation system 200 in accordance with one 
embodiment of the present invention. System 200 includes parsing components 204 and 206, statistical word associ- 
ation learning component 208, logical form alignment component 210, lexical knowledge base building component 
212, bilingual dictionary 21 4, dictionary merging component 21 6, transfer mapping database 21 8 and updated bilingual 

30 dictionary 220. During training and translation run time, the system 200 utilizes analysis component 222, matching 
component 224, transfer component 226 and/or generation component 228. 

[0026] In one illustrative embodiment, a bilingual corpus is used to train the system. The bilingual corpus includes 
aligned translated sentences (e.g., sentences in a source or target language, such as English, in 1 -to-1 correspondence 
with their human-created translations in the other of the source or target language, such as Spanish). During training, 
35 sentences are provided from the aligned bilingual corpus into system 200 as source sentences 230 (the sentences to 
be translated), and as target sentences 232 (the translation of the source sentences). Parsing components 204 and 
206 parse the sentences from the aligned bilingual corpus to produce source logical forms 234 and target logical forms 
236. 

[0027] During parsing, the words in the sentences are converted to normalized word forms (lemmas) and can be 
40 provided to statistical word association learning component 208. Both single word and multi-word associations are 
iteratively hypothesized and scored by learning component 208 until a reliable set of each is obtained. Statistical word 
association learning component 208 outputs learned single word translation pairs 238 as well as multi-word pairs 240. 
[0028] The multi-word pairs 240 are provided to a dictionary merge component 216, which is used to add additional 
entnes into bilingual dictionary 214 to form updated bilingual dictionary 220. The new entries are representative of the 
45 multi-word pairs 240. 

[0029] The single and multi-word pairs 238, along with source logical forms 234 and target logical forms 236 are 
provided to logical form alignment component 21 0. Briefly, component 21 0 first establishes tentative correspondences 
between nodes in the source and target logical forms 230 and 236, respectively. This is done using translation pairs 
fromabilingual lexicon (e.g. bilingual dictionary) 214, which can be augmented with the single and multi-word translation 

50 pairs 238, 240 from statistical word association learning component 208. After establishing possible correspondences, 
alignment component 21 0 aligns logical form nodes according to both lexical and structural considerations and creates 
word and/or logical form transfer mappings 242. This aspect will be explained in greater detail below. 
[0030] Basically, alignment component 210 draws links between logical forms using the bilingual dictionary informa- 
tion 214 and single and multi-word pairs 238, 240. The transfer mappings are optionally filtered based on a frequency 

55 with which they are found in the source and target logical forms 234 and 236 and are provided to a lexical knowledge 
base building component 212. 

[0031] While filtering is optional, in one example, if the transfer mapping is not seen at least twice in the training data, 
it is not used to build transfer mapping database 218, although any other desired frequency can be used as a filter as 
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well. It should also be noted that other filtering techniques can be used as well, other than frequency of appearance. 
For example, transfer nnappings can be filtered based upon whether they are formed from complete parses of the input 
sentences and based upon whether the logical forms used to create the transfer mappings are completely aligned. 
[0032] Component 212 builds transfer mapping database 218, which contains transfer mappings that basically link 
5 words and/or logical forms in one language, to words and/or logical forms in the second language. With transfer mapping 
database 21 8 thus created, system 200 is now configured for runtime translations. 

[0033] During translation run time, a source sentence 250, to be translated, is provided to analysis component 222. 
Analysis component 222 receives source sentence 250 and creates a source logical form 252 based upon the source 
sentence input. An example maybe helpful. In the present example, source sentence 250 is a Spanish sentence "Haga 
10 click en el boton de opcion" which is translated into English as "Click the option button" or, literally, "Make click in the 
button of option". 

[0034] FIG. 3A illustrates the source logical form 252 generated for source sentence 250 by analysis component 
222. The source logical form 252 is provided to matching component 224. Matching component 224 attempts to match 
the source logical form 252 to logical forms in the transfer mapping database 218 in order to obtain a linked logical 

15 form 254. Multipletransfermappings may match portions of source logical form 252. Matching component224 searches 
forthebest set of matching transfer mappings in database 21 8 that have matching lemmas, parts of speech, and other 
feature infonnation. The set of best matches is found based on a predetermined metric. For example, transfer mappings 
having larger (more specific) logical forms may illustratively be preferred to transfer mappings having smaller (more 
general) logical forms. Among mappings having logical forms of equal size, matching component 224 may illustratively 

20 prefer higher frequency mappings. Mappings may also match overlapping portions of the source logical form 252 
provided that they do not conflict with each other in any way. A set of mappings collectively may be illustratively preferred 
if they cover more of the input sentence than alternative sets. Other metrics used in matching the input logical form to 
those found in database 21 8 are discussed in greater detail below with respect to Table 1 . 

[0035] After a set of matching transfer mappings is found, matching component 224 creates links on nodes in the 
25 source logical form 252 to copies of the corresponding target words or logical form segments received by the transfer 
mappings, to generate linked logical form 254. FIG. 3B illustrates an example of linked logical form 254 for the present 
example. Links for multi-word mappings are represented by linking the root nodes (e.g., Hacer and Click) of the cor- 
responding segments, then linking an asterisk to the other source nodes participating in the multi-word mapping (e.g., 
Usted and Clic). Sublinks between corresponding individual source and target nodes of such a mapping (not shown 
30 in FIG. 3B) may also illustratively be created for use during transfer. 

[0036] Transfer component 226 receives linked logical form 254 from matching component 224 and creates a target 
logical form 256 that will form the basis of the target translation. This is done by performing a top down traversal of the 
linked logical form 254 in which the target logical form segments pointed to by links on the source logical form 252 
nodes are combined. When combining together logical form segments for possibly complex multi-word mappings, the 
35 sublinks set by matching component 224 between individual nodes are used to determine correct attachment points 
for modifiers, etc. Default attachment points are used if needed. 

[0037] In cases where no applicable transfer mappings are found, the nodes in source logical form 252 and their 
relations are simply copied into the target logical form 256. Default single word translations may still be found in transfer 
mapping database 21 8 for these nodes and inserted in target logical form 256. However, if none are found, translations 
40 can illustratively be obtained from updated bilingual dictionary 220, which was used during alignment. 

[0038] FIG. 3C illustrates a target logical form 256 for the present example. It can be seen that the logical form 
segments from "click" to "button" and from "button" to "option" were stitched together from linked logical form 254 to 
obtain target logical form 256. 

[0039] Generation component 228 is illustratively a rule-based, application-independent generation component that 
45 maps from target logical form 256 to the target string (or output target sentence) 258. Generation component 228 may 
illustratively have no information regarding the source language of the input logical forms, and works exclusively with 
information passed to it by transfer component 226. Generation component 228 also illustratively uses this information 
in conjunction with a monolingual (e.g., for the target language) dictionary to produce target sentence 258. One generic 
generation component 228 is thus sufficient for each language. 
50 [0040] It can thus be seen that the present system parses information from various languages into a shared, common, 
logical form so that logical forms can be matched among different languages. The system can also utilize simple filtering 
techniques in building the transfer mapping database to handle noisy data input. Therefore, the present system can 
be automatically trained using a very large number of sentence pairs. In one illustrative embodiment, the number of 
sentence pairs is in excess of 1 0,000. In another illustrative embodiment, the number of sentence pairs is greater than 
55 50,000 to 100,000, and may be in excess of 180,000, 200,000, 350,000 or even in excess if 500,000 or 600,000 
sentence pairs. Also, the number of sentence pairs can vary for different languages, and need not be limited to these 
numbers. 
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LOGICAL FORM ALIGNMENT 

[0041] Fig. 4 illustrates a method 300 of associating logical forms of at least sentence fragments from two different 
languages, wherein the logical forms comprise nodes organized in a parent/child structure. Method 300 includes as- 
5 sociating nodes of the logical forms to form tentative correspondences as indicated at block 302 and aligning nodes 
of the logical forms by eliminating at least one of the tentative correspondences and/or structural considerations as 
indicated at block 304. 

[0042] As indicated above with respect to Fig. 2, alignment component 21 0 accesses bilingual dictionary 21 4 In order 
to form tentative correspondences, typically lexical correspondences, between the logical forms. Bilingual dictionary 
10 21 4 can be created by merging data from multiple sources, andean also use inverted target-to-source dictionary entries 
to improve coverage. As used herein, bilingual dictionary 214 also represents any other type of resource that can 
provide correspondences between words. Bilingual dictionary 21 4 can also be augmented with translation correspond- 
ences acquired using statistical techniques. 

[0043] In Fig. 2, the statistical techniques are performed by component 208. Although the output from component 
15 208 can be used by alignment component 21 0, it is not necessary for operation of alignment component 21 0. However, 
one embodiment of component 208 will be described here, briefly, for the sake of completeness. 
[0044] Component 208 receives a parallel, bilingual training corpus that is parsed into its content words. Word as- 
sociation scores for each pair of content words consisting of a word of language LI that occurs in a sentence aligned 
in the bilingual corpus to a sentence of language L2 in which the other word occurs. A pair of words is considered 
20 "linked" In a pair of aligned sentences if one of the words is the most highly associated, of all the words in its sentence, 
with the other word. The occurrence of compounds Is hypothesized in the training data by identifying maximal, con- 
nected sets of linked words In each pair of aligned sentences in the processed and scored training data. Whenever 
one of these maximal, connected sets contains more than one word in either or both of the languages, the subset of 
the words in that language is hypothesized as a compound. The original input text is rewritten, replacing the hypoth- 
25 esized compounds by single, fused tokens. The association scores are then recomputed for the compounds (which 
have been replaced by fused tokens) and any remaining individual words in the input text. The association scores are 
again recomputed, except that this time, co-occurrences are taken into account in computing the association scores 
only where there is no equally strong or stronger other association In a particular pair of aligned sentences in the 
training corpus. 

30 [0045] Translation pairs can be identified as those word pairs or token pairs that have association scores above a 
threshold, after the final computation of association scores. 

[0046] Similarly, component 208 can also assist in identifying translations of "captoids", by which we mean titles, or 
other special phrases, all of whose words are capitalized. (Finding translations of captoids presents a special problem 
in languages like French or Spanish, in which convention dictates that only the first word of such an Item is capitalized, 

35 so that the extent of the captold translation Is difficult to determine.) In that embodiment, compounds are first identified 
in a source language (such as English). This can be done by finding strings of text where the first word begins with a 
capital letter, and later tokens in the contiguous string do not begin with a lowercase letter. Next, compounds are 
hypothesized in the target text by finding words that start with a capital letter and flagging this as the possible start of 
a corresponding compound. The target text is then scanned from left to right flagging subsequent words that are most 

40 strongly related to words in the identified compound in the source text, while allowing up to a predetermined number 
(e.g., 2) contiguous non-most highly related words, so long as they are followed by a most highly related word. 
[0047] The left to right scan can be continued until more than the predetermined number (e.g., more than 2) contig- 
uous words are found that are not most highly related to words In the Identified compound in the source text, or until 
no more most highly related words are present in the target text, or until punctuation is reached. 

45 [0048] While the above description has been provided for component 208, it is to be noted that component 208 is 
optional. 

[0049] Referring again to method 300 In FIG. 4, generally, forming tentative correspondences in step 302 is aggres- 
sively pursued with the purpose of attempting to maximize the number of tentative correspondences formed between 
the logical forms. Accuracy of the tentative correspondences is not the most important criteria in step 302 because 

50 step 304 will further analyze the tentative correspondences and remove those that are determined to be incorrect. 
[0050] Bilingual dictionary 214 represents direct translations used for forming tentative correspondences. However, 
in order to form additional tentative correspondences, derivational morphology can also be used. For example, trans- 
lations of morphological bases and derivations, and base and derived forms of translations, can also be used to form 
tentative correspondences in step 302. Likewise, tentative correspondences can also be formed between nodes of the 

55 logical forms wherein one of the nodes comprises more lexical elements or words than the other node. For instance, 
as is common, one of the nodes can comprise a single word in one of the languages, while the other node comprises 
at least two words in the other language. Closely related languages such as English, Spanish, etc. also have word 
similarity (cognates) that can be used with fuzzy logic to ascertain associations. These associations can then be used 
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to form tentative correspondences. 

[0051] At this point, it may be helpful to consider an example of logicai forms to be aligned. Referring to Fig. 5A, 
logical form 320 was generated for the sentence "En Informacion del hipervinculo, haga die en la direccion del hiper- 
vinculo", while iogical form 322 was generated for the English transiation as "Under Hyperlink Information, clicl<the 
5 Hyperlink address." 

[0052] Fig. 5A further iliustrates each of the tentative correspondences 323 identified in step 302. As an example of 
the aggressive pursuit of tentative correspondences in step 302, in this example, each of the occurrences of "Hiperv- 
inculo" includes two different tentative correspondences with "Hyperlink_lnformation" and "hyperlink" in the English 
logical form 322. 

10 [0053] Referring now to step 304, the iogical forms are aligned, which can include eliminating one or more of the 
tentative correspondences formed in step 302, and/or which can be done as a function of structurai considerations of 
the logical forms. In one embodiment, step 304 includes aiigning nodes of the logicai forms as a function of a set of 
ruies. In a further embodiment, each of the rules of the set of rules is applied to the logical forms in a selected order. 
In particular, the rules are ordered to create the most unambiguous alignments ("best alignments") first, and then, if 

^5 necessary, to disambiguate subsequent node alignments. It is important to note that the order that the rules are applied 
in is not based upon the structure of the logical forms, i.e., top-down processing or bottom-up processing but rather, 
to begin with the most linguisticaliy meaningful alignments, wherever they appear in the logical form. As such, this set 
of rules can be considered to be applied to the nodes of each of the logical forms non-lineariy as opposed to linearly 
based upon the structure of the logical forms. Generally, the rules are intended to be language-neutral in order that 

20 they can be universaliy applied to any language. 

[0054] Fig. 6 generaily illustrates application of the set of rules to the iogical forms as method 328. At step 330, each 
of the nodes of the iogical forms is considered to be "unaiigned" as opposed to "aligned". The set of rules is applied 
to the unaligned nodes irrespective of structure at step 332 to form aligned nodes. Therefore, it is desirable to distinguish 
between unaligned nodes and aligned nodes. One technique includes assigning all of the nodes initiaily to the set of 

25 unaiigned nodes, and removing nodes when they are aligned. The use of sets whether actively formed in different 
locations of a computer readable medium or virtualiy formed through the use of Booiean tags associated with the nodes 
merely provides a convenient way in which to identify unaligned nodes and aligned nodes. 

[0055] At step 332, the set of rules is applied to each of the unaligned nodes. FIG. 7 schematicaliy illustrates aspects 
of step 332 that can be impiemented to apply the set of rules. In one embodiment as discussed above, the rules are 
30 appiied in a specified order. Herein "N" is a counter that is used to indicate which of the rules is applied. In the first 
iteration, step 334 applies the first rule to each of the unaligned nodes. If a rule faiis to be applicabie to any of the 
unaiigned nodes, another ruiefrom the set (and in one embodiment, the next successive rule indicative of a linguistically 
meaningful alignment) is then applied as indicated at steps 336 and 338. 

[0056] If all the rules of the set of rules have been appiied to ail the nodes at step 340, the aiignment procedure is 
35 finished. It should be noted that under some situations, not ali of the nodes will be aligned. 

[0057] If a rule can be applied to a set of nodes of the logicai forms, the nodes are identified as being aligned and 
removed from the set of unaligned nodes, and application of the rules continues. However, in one embodiment, it is 
advantageous to begin again with the rules once some rules have been applied to obtain a more iinguistically meaningfui 
alignment. Therefore, it can be desirable to again apply rules that have previously been applied. In this manner, in one 
40 embodiment, each of the ruies of the set of rules is applied again starting with, for example, the first rule as indicated 
at step 342. 

[0058] The following is an exemplary set of rules for aligning nodes of the iogical forms. The set of nodes presented 
herein is ordered based on the strongest to weakest linguisticaily meaningfui alignments of the nodes. As appreciated 
by those skiiled in the art, reordering of at ieast some of the rules presented herein may not significantly aiter the quality 
45 of aiignments of the iogical forms. 

1 . If a bi-directionally unique translation exists between a node or set of nodes in one logical form and a node or 
set of nodes in the other logical form, the two nodes or sets of nodes are aiigned to each other. A bi-directionally 
unique translation exists if a node or a set of nodes of one logicai form has a tentative correspondence with a node 

50 or a set of nodes in the other iogical form, such that every node in the first set of nodes has a tentative correspond- 

ence with every node in the second set of nodes, and no other correspondences, and every node in the second 
set of nodes has a tentative correspondence with every node in the first set of nodes, and no other correspond- 
ences. 

2. A pair of parent nodes, one from each logical form, having a tentative correspondence to each other, are aligned 
55 with each other if each child node of each respective parent node is already aligned to a chiid of the other parent 

node. 

3. A pair of child nodes, one from each logical form, are aligned with each other if a tentative correspondence 
exists between them and if a parent node of each respective child node is aiready aligned to a corresponding 
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parent node of the other child. 

4. A pair of nodes, one from each logical form, are aligned to each other if respective parent nodes of the nodes 
under consideration are aligned with each other and respective child nodes are also aligned with each other. 

5. A node that is a verb and an associated child node that is not a verb from one logical form are aligned to a 
5 second node that is a verb of the other logical form if the associated child node is already aligned with the second 

verb node, and either the second verb node has no aligned parent nodes, or the first verb node and the second 
verb node have child nodes aligned with each other. 

6. A pair of nodes, one from each logical form, comprising the same part- of -speech, are aligned to each other, if 
there are no unaligned sibling nodes, and respective parent nodes are aligned, and linguistic relationships between 

10 the set of nodes under consideration and their respective parent nodes are the same. 

7. A pair of nodes, one from each logical form, comprising the same part-of-speech, are aligned to each other if 
respective child nodes are aligned with each other and the linguistic relationship between the set of nodes under 
consideration and their respective child nodes are the same. 

8. If an unaligned node of one of the logical forms having immediate neighbor nodes comprising respective parent 
15 nodes, if any, all aligned, and respective child nodes, if any, all aligned, and if exactly one of the immediate nodes 

is a non-compound word aligned to a node of the other logical form comprising a compound word, then align the 
unaligned node with the node comprising the compound word. Note that the immediate neighbor nodes herein 
comprise adjacent parent and child nodes however the existence of parent and child nodes is not required, but if 
they are present, they must be aligned. 
20 9, A pair of nodes, one from each logical form, comprising pronouns, are aligned to each other if respective parent 

nodes are aligned with each other and neither of the nodes under consideration have unaligned siblings. 

10. A pair of nodes, one from each logical form, comprising nouns are aligned to each other if respective parent 
nodes comprising nouns are aligned with each other and neither of the nodes under consideration have unaligned 
sibling nodes, and wherein a linguistic relationship between each of the nodes under consideration and their re- 

25 spective parent nodes comprises either a modifier relationship or a prepositional relationship. 

1 1 . A first verb node of one logical form is aligned to a second verb node of the other logical form if the first verb 
node has no tentative correspondences and has a single associated child verb node that is already aligned with 
the second verb node. 

12. A first verb node and a single, respective parent node of one logical form is aligned to a second verb node of 
30 the other logical form if the first verb node has no tentative correspondences and has a single parent verb node 

that is already aligned with the second verb node, where the single parent verb node has no unaligned verb child 
nodes besides the first verb node, and the second verb node has no unaligned verb child nodes. 

1 3. A first node comprising a pronoun of one logical form is aligned to a second node of the other logical form if a 
parent node of the first node is aligned with the second node and the second node has no unaligned child nodes. 

35 1 4. A first verb node and a respective parent verb of one logical form is aligned to a second verb node of the other 

logical form if the first verb node has no tentative correspondences and the parent verb node is aligned with the 
second verb node and where the relationship between the first verb and the parent verb node comprise a modal 
relationship. 

40 [0059] Some general classifications of the rules provided above include that one rule (rule 1) is primarily based upon 
the correspondences established in step 302, and in the embodiment illustrated, it is considered to be the strongest 
meaningful alignment since no ambiguity is present. Other rules such as rules 2, 3, 11, 12 and 14 are based on a 
combination of, or a lack of, tentative correspondences and the structure of the nodes under consideration and previ- 
ously aligned nodes. The remaining rules rely solely on relationships between nodes under consideration and previ- 

45 ously aligned nodes. Other general classifications that can be drawn include that the rules pertain to verbs, nouns and 
pronouns. 

[0060] Referring back to the logical forms and tentative correspondences of Fig. 5A, the rules set out above can be 
applied according to the method 300 of Fig. 4 in order to align the nodes as illustrated in Fig. 5B. In this example, the 
two instances of "Hipervinculo" have two ambiguous tentative correspondences, and while the correspondence from 
50 "Informacion" to "Hyperlink_lnformation" is unique, the reverse is not. It should also be noted that neither the mono- 
lingual nor the bilingual lexicons or dictionaries have been customized for this domain. For example, there is no entry 
in the lexicon for "Hyperlinkjnformation". This unit has been assembled by general rules that link sequences of cap- 
italized words. Tentative lexical correspondences established for this element are based on translations found for its 
individual components. 

55 [0061] Applying the alignment rules as described above, the alignment mappings created by the rules are illustrated 
in Fig. 5B as dotted lines 344, and are obtained as follows. 

[0062] Iterating through the rules again, rule 1 applies in three places, creating alignment mappings between "direc- 
cion" and "address", "usted" and "you", and "die" and "click". These are the initial "best" alignments that provide the 
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anchors from which the method will work outwards to align the rest of the structure. 

[0063] Rule 2 does not apply to any nodes, but Rule 3 applies next to align the instance of "hipervinculo", that is the 
child of "direccion" to "hyperlink", which is the child of "address". The alignment method thus leveraged a previously 
created alignment ("direccion" to "address") and the structure of the logical form to resolve the ambiguity present at 
5 the lexical level. 

[0064] Rule 1 applies (where previously it did not) to create a many-to-one mapping between "Informacion" and 
"hipervinculo" to "Hyperlink_lnformation". The uniqueness condition in this rule is now met because the ambiguous 
alternative was cleared away by the prior application of Rule 3. 

[0065] Rule 4 does not apply, but rule 5 applies to rollup "hacer" with its object "die", since the latter is already aligned 
10 to a verb. This produces the many-to-one alignment of "hacer" and "die" to "dick" 

[0066] Referring back to Fig. 7, alignment of the logical forms is completed when the rules are no longer applicable 
to any of the nodes. At this point, transfer mappings can be obtained by component 212. 

[0067] Fig. 8 illustrates some of the transfer mappings obtainable from the example of aligned logical forms in Fig. 
5B (other than transfer mapping 353 which is included as an example of a conflicting transfer mapping discussed in 

15 the next section). Generally, a transfer mapping or simply "mapping" is indicative of associating a word or logical form 
of a first language with a corresponding word or logical form of a second language. The mappings can be stored on 
any computer readable medium as explicit pointers linking the words or logical forms of the first language with the 
corresponding words or logical forms of the second language. Likewise, the mappings can be stored with the words 
or logical forms rather than in a separate database. As appreciated by those skilled in the art, other techniques can 

20 be used to associate words or logical forms of the first language with words or logical forms of the second language, 
and it is this association, that constitutes the mappings regardless of the specific techniques used in order to record 
this information. 

[0068] Each mapping created during the alignment procedure can be a base structure upon which further mappings 
with additional context are also created. In particular, and as another aspect of the present invention, information can 

25 be stored on a computer readable medium to translate text from a first language to a second language, where the 
information comprises a plurality of mappings. Each mapping is indicative of associating a word or logical form of the 
first language with a word or logical form of the second language. However, in addition, at least some of the mappings 
corresponding to logical forms of the first language have varying context with some common elements. Likewise, at 
least some of the logical forms of the second language corresponding to the logical forms of the first language may 

30 also have varying context with some common elements. In other words, at least some of the core mappings obtained 
from the alignment procedure are used to create other, competing mappings having varying types and amounts of 
local context. 

[0069] Referring to Fig. 8, mappings 350, 352, and 354 illustrate how an element of a logical form can vary. Mapping 
350 comprises the base or core mapping on which further mappings are created. Mapping 352 expands the core 

35 mapping 350 to include an additional linguistic element, herein the direct object of the word "click", while the mapping 
354 is expanded from the core mapping 350 such that the additional element comprises an under-specified node ("*") 
indicating a part of speech but no specific lemma. By comparing the mappings 350, 352 and 354, as well as mappings 
356 and 358, it can be seen that the logical forms of the first language have common elements (parts of speech and/ 
or lemmas), while the logical forms of the second language also have common elements. 

40 [0070] By storing mappings indicative of logical forms with overlapping context, during translation run time, fluency 
and general applicability of the mappings for translating between the languages is maintained. In particular, by having 
mappings associating both words and smaller logical forms of the languages, translation from the first language to the 
second language is possible if the text to be translated was not seen in the training data. However, to the extent that 
the larger context was present in the training data, this is also reflected in the mappings such that when a mapping of 

45 larger context is applicable, a more fluent translation between the first language and the second language can be 
obtained. 

[0071] Generally, linguistic constructs are used to provide boundaries for expanding the core mappings to include 
additional context. For example, a mapping for an adjective can be expanded to include the noun it modifies. Likewise, 
a mapping for a verb can be expanded to include the object as context. In another example, mappings for noun collo- 
50 cations are provided individually as well as a whole. As further illustrated in Fig. 8, some of the mappings can include 
under-specified nodes ("*"), wherein the part of speech is indicated but no specific lemma is provided. These types of 
mappings increase the overall applicability of the mappings for translating from the first language to the second lan- 
guage, but also include context to enhance fluency of the translation obtained. 

[0072] In general, mappings that can be created may have any number of wild-card or underspecified nodes, which 
55 may be underspecified in a number of different ways. For example, they may or may not specify a part-of-speech, and 
they may specify certain syntactic or semantic features. For example, a pattern may have a wild-card node with the 
feature "ProperName" or "Location" marked, indicating that the pattern only applies when that node is matched to an 
input node that has the same feature. These wild-cards allow the system to hypothesize generalized mappings from 
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specific data. 

MATCHING THE TRANSFER IVIAPPINGS DURING RUN TIME 

[0073] In addition to the information pertaining to the mappings between the words or logical forms of the first lan- 
guage and the second language, additional information can also be stored or used during run time translation. The 
additional information can be used to choose an appropriate set of mappings and resolve conflicts as to which mappings 
to use, i.e. (referring to FiG. 2) when a source logical form 252 (or part thereof) generated for a source sentence 250 
matches more than one source side of the transfer mappings in the transfer mapping database 21 8. 
[0074] For example, when the source logicai form matches the source side of multiple transfer mappings in database 
21 8, a subset of these matching transfer mappings is iliustratively selected such that ail transfer mappings in the subset 
are compatible with one another (i.e., they are not conflicting) and based on a metric that is a function of how much of 
the input sentence the transfer mappings in the subset coilectively cover, as weil as other measures related to individual 
transfer mappings. Some such measures are set out in Table 1 . 

Table 1 



1. 
2. 
3. 
4. 
5. 



Size of transfer mapping matched. 

The frequency with which the transfer mapping was seen in the training data. 

The frequency with which the transfer mapping was generated from fuily aligned logical forms. 

The frequency with which the transfer mapping was generated from partially aligned logical forms. 

The frequency with which the transfer mapping was generated from logical forms that resulted from a fitted 

parse. 

An alignment score assigned to the transfer mapping by the alignment component. 



[0075] Once the subset of matching transfer mappings is selected, the transfer mappings in the subset are combined 
into a transfer logical form from which the output text is generated. 

[0076] It should be noted that the subset of matching transfer mappings can contain overlapping transfer mappings, 
so long as they are compatible. For example, the following logical form can be generated for the Spanish sentence 
"Haga die en el direccion de la oficina" which can be translated as "Click the office address": 
Hacer -- Dobj - click 

- en - direccion 

- de - oficina 

This logical form can potentially be matched to all of the transfer mappings 350, 352 and 354 because each transfer 
mapping contains this logical form. These transfer mappings overlap, but do not conflict (because all can be translated 
as the same thing). Therefore, all may be included in the subset of matching transfer mappings, and the transfer logical 
form can be generated from them. However, if it is desired to choose among them, the best choice may be transfer 
mapping 352 because it is the largest. Others could be chosen for a variety of other reasons as well. 
[0077] An example of conflicting, matching transfer mappings is shown as transfer mapping 353, which conflicts with 
transfer mapping 352. Therefore, for example, the logical form: 
Hacer -- Dobj - click 

- en - direccion 

would match all of transfer mappings 350, 352, 353 and 354. However, since transfer mappings 352 and 353 conflict 
(because they are translated differently) both cannot be part of the selected subset of matching transfer mappings. 
Thus, one is selected based on a predetermined metric. For example, subset 350, 352 and 354 can be compared 
against subset 350, 353 and 354 to see which covers the most nodes in the input logical form, collectively. Also, both 
transfer mappings 352 and 353 are the same size (on the source side) . Therefore, other information can be used to 
distinguish between them in selecting the subset of matching transfer mappings. 

[0078] As another example of conflicting transfer mappings, assume that a number of sentences processed during 
training included the phrase "click <something>" that aligned to the Spanish "hacer die en <something>". In other 
sentences, assume the sentence "click <something>" aligned to "elegir <something>" (literally "select something"). 
[0079] This yields the following mappings (note these examples are English mapped to Spanish whereas previous 
examples have been Spanish mapped to English): 
Click hacer 

Tobj -- * ^ Tobj -- die 
en --* 
for the first case, and 
click elegir 
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Tobj -- * Tobj -- * 

in the second case. 

[0080] In the proper contexts, translating "click" to "select" may be a legitinnate variation. However it does present a 
problenn in sonne cases. For example, notice that the source side of both transfers is identical, so at runtime, if the input 
5 logical form matches that source side, we are left with having to choose between the two different target sides, i.e. it 
must be decided whether to translate the input as "hacer die." or as "elegir.."? In the absence of further context (which 
would likely have manifested itself by causing differing source sides of the transfers) we choose between them based 
on various frequency and scoring metrics. 

[0081] Anothertype of conflict should also be mentioned. At runtime, for a given input sentence, there may be multiple 
10 matching transfer mappings that match different parts of the input sentence. Several of them can be chosen as the 
selected subset so that they can be stitched together to produce a transfer LF that covers the entire input. However 
some of these matches that are stitched together will overlap one another, and some will not. Of the ones that overlap, 
we can only use those that are "compatible" with one another. As discussed above, by "overlap" we mean two mappings 
where at least one node of the input sentence is matched by both mappings. By compatible, we mean the following: 
15 matches are always compatible if they do not overlap, and matches that do overlap are compatible if the target sides 
that correspond to the node(s) at which they overlap are identical. 

[0082] For example, if an input sentence is "cambiar configuracion de seguridad" (translated as "change the security 
setting") and it matches a transfer mapping as follows: 
cambiar change 
20 Tobj -- configuracion) Tobj - setting 

and we match another mapping of: 
configuracion setting 

mod - seguridad Mod security 

then the two matches do overlap (on "configuracion"), but they are compatible, because they also both translate "con- 
25 figuracion" to "setting". Therefore, we can combine them to produce a transfer LF (or target LF)of : 
change 

Tobj setting 

Mod security 

[0083] However suppose there was also a third mapping of: 

30 configuracion value 

Mod - seguridad Mod setting 

then this mapping which does overlap the previous two at "configuracion", is not compatible, because it would translate 
"configuracion" to "value", not "setting". Therefore, this mapping cannot be merged with the previous two, so either 
this transfer mapping, or the previous two, must be chosen, but not both at the same time. 

35 [0084] Table 1 shows examples of the information that can be used to further define the subset of matching transfer 
mappings (either to choose among conflicting matching transfer mappings or to narrow the subset of compatible, 
matching transfer mappings). Such information can include how much of the input sentence is covered by the subset 
of matching transfer mappings (collectively) and the size of the mappings, which can be ascertained from the logical 
form that is matched in the transfer mapping itself. The size of a logical form includes both the number of specified 

40 nodes as well as the number of linguistic relationships between the nodes. Thus, by way of example, the size of the 
logical form from the source side of mapping 350 equals 2, while the size of the logical form on the target side equals 
1 . In another example, the logical form on the source side of mapping 354 equals 4, while the target side of mapping 
354 equals 2. 

[0085] The information for choosing the subset of transfer mappings can also include other information related to 
45 individual transfer mappings, such as the frequency with which the logical forms in the transfer mapping are seen in 
the training data. If desired, the training data can include "trusted" training data, which can be considered more reliable 
than other training data. The frequency of the mapping as seen in the trusted training data can be retained in addition, 
or in the alternative, to storing the frequency of the mapping as seen in all of the training data. 
[0086] Other information that can be helpful in selecting the subset of matching transfer mappings when matching 
50 source logical forms to transfer mappings includes the extent of complete alignment of the logical forms in the training 
data from which the logical forms of a transfer mapping have been obtained. In other words, the alignment procedure 
can fully or completely align the nodes of the larger logical forms, orsome nodes can remain unaligned. In the example 
of Fig. 5B, all the nodes were aligned; however, as indicated above, this may not always be the case. Those mappings 
associated with fully aligned logical forms may be considered more reliable. Of course, information for resolving conflicts 
55 or further defining the subset can also indicate the frequency with which the mapping was generated from both fully 
aligned logical forms as well as partially aligned logical forms. 

[0087] Likewise, additional information can include the frequency with which the logical forms in the transfer mapping 
originated from a complete parse of the corresponding training data. In particular, the frequency with which the mapping 
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originated from a complete or fitted parse, or in contrast, tlie frequency that the mapping originated from only a partial 
parse can be stored for later use in resolving conflicts while matching during translation. 

[0088] Another form of information can include a score or value assigned to the transfer mapping by the alignment 
procedure used to extract the mapping. For instance, the score can be a function of how "strong" (linguistically mean- 
5 ingful) the aligned nodes are (or how confident the alignment component is in the transfer mapping). The score can 
therefore be a function of when (which iteration) and which rule formed the alignment. The particular function or metric 
used to calculate the alignment score is not crucial, and any such metric can be used to generate information related 
to an alignment score that can be used during run time translation. 

[0089] It should be noted that, although the present invention is described above primarily with respect to analyzing, 
10 aligning and using logical forms, at least some of the inventive concepts discussed herein are applicable to other 
dependency structures as well. 

[0090] Although the present invention has been described with reference to particular embodiments, workers skilled 
in the art will recognize that changes may be made in form and detail without departing from the spirit and scope of 
the invention. 

15 

Claims 

1 . A computer-implemented method of associating dependency structures from two different languages, wherein the 
20 dependency structures comprise nodes organized in a parent/child structure, the computer-implemented method 

comprising: 

associating nodes of the dependency structures to form tentative correspondences; and 
aligning nodes of the dependency structures as a function of at least one of eliminating at least one of the 
25 tentative correspondences and structural considerations. 

2. The computer-implemented method of claim 1 wherein associating includes forming tentative correspondences 
comprising direct translations. 

30 3. The computer-implemented method of claim 1 wherein associating includes forming tentative correspondences 
comprising translations of morphological bases and derivations. 

4. The computer-implemented method of claim 1 wherein associating includes forming tentative correspondences 
comprising bases and derived forms of translations. 

35 

5. The computer-implemented method of claim 1 wherein associating includes forming tentative correspondences 
between nodes wherein one of the nodes comprises more lexical elements than the other node. 

6. The computer-implemented method of claim 5 wherein said one of the nodes is a single word in one of the lan- 
40 guages and said other node comprises at least two words in the other language. 

7. The computer-implemented method of claim 1 wherein aligning pursuant to structural considerations comprises 
aligning nodes as a function of a set of rules. 

45 8. The computer-implemented method of claim 7 wherein each of the rules of the set of rules are applied to the 
dependency structures in a selected order. 

9. The computer-implemented method of claim 8 wherein each of the dependency structures comprise a set of un- 
aligned nodes and wherein each of the rules are applied successively to the set of unaligned nodes until a set of 

50 aligned nodes is identified, then the nodes of the set of aligned nodes are removed from the set of unaligned nodes 

and each of the rules of the set of rules is again applied successively to the set of unaligned nodes. 

10. The computer-implemented method of claim 8 wherein one rule of the set of rules comprises aligning a set of 
nodes if a bidirectionally unique translation exists. 

55 

11. The computer-implemented method of claim 8 wherein one rule of the set of rules comprises aligning a pair of 
parent nodes, one from each dependency structure having a tentative correspondence to each other, if each child 
node of each respective parent node is already aligned to a child of the other parent node. 
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12. The computer-implennented method of claim 8 wherein one rule of the set of rules comprises aligning a pair of 
child nodes, one from each dependency structure, if a tentative correspondence exists between them and if a 
parent node of each respective child node is already aligned to a corresponding parent node of the other child. 

5 13. The computer-implemented method of claim 8 wherein one rule of the set of rules comprises aligning a pair of 
nodes, one from each dependency structure, if respective parent nodes of the nodes under consideration are 
aligned with each other and respective child nodes are also aligned with each other. 

14. The computer-implemented method of claim 8 wherein one rule of the set of rules comprises aligning a first verb 
10 node and an associated child node that is not a verb node from one dependency structure to a second verb node 

of the other dependency structure if the associated child node is already aligned with the second verb node, and 
either the second verb node has no aligned parent nodes, or the first verb node and the second verb node have 
associated child nodes aligned with each other. 

15 15. The computer-implemented method of claim 8 wherein one rule of the set of rules comprises aligning a pair of 
nodes, one from each dependency structure, comprising the same part-of -speech, if there are no unaligned sibling 
nodes, and respective parent nodes are aligned, and linguistic relationships between the set of nodes under con- 
sideration and their respective parent nodes are the same. 

20 16. The computer-implemented method of claim 8 wherein one rule of the set of rules comprises aligning a pair of 
nodes, one from each dependency structure, comprising the same part-of-speech, if respective child nodes are 
aligned with each other and the linguistic relationship between the set of nodes under consideration and their 
respective child nodes are the same. 

25 17. The computer-implemented method of claim 8 wherein one rule of the set of rules comprises if an unaligned node 
of one of the dependency structures having immediate neighbor nodes comprising respective parent nodes, if any, 
and respective child nodes, if any, all aligned, and if exactly one of the immediate neighbor nodes is a non-com- 
pound word aligned to a node comprising a compound word, then align the unaligned node with the node comprising 
the compound word. 

30 

18. The computer-implemented method of claim 8 wherein one rule of the set of rules comprises aligning a pair of 
nodes, one from each dependency structure, comprising pronouns if respective parents are aligned with each 
other and neither of the nodes have unaligned siblings. 

35 19. The computer-implemented method of claim 8 wherein one rule of the set of rules comprises aligning a pair of 
nodes, one from each dependency structure, comprising nouns if respective parents comprise nouns are aligned 
with each other and neither of the nodes have unaligned siblings, and where a linguistic relationship between each 
of the nodes and the respective parents comprise either a modifier relationship or a prepositional relationship. 

40 20. The computer-implemented method of claim 8 wherein one rule of the set of rules comprises aligning a first verb 
node of one dependency structure to a second verb node of the other dependency structure if the first verb node 
has no tentative correspondences and a single associated child verb node that is already aligned to the second 
verb node. 



45 21 . The computer-implemented method of claim 8 wherein one rule of the set of rules comprises aligning a first verb 
node and a single, respective parent node of one dependency structure to a second verb node of the other de- 
pendency structure if the first verb node has no tentative correspondences and the single parent verb node is 
aligned with the second verb node, where the single parent verb node has no unaligned verb child nodes besides 
the first verb node, and the second verb node has no unaligned verb child nodes. 

50 

22. The computer-implemented method of claim 8 wherein one rule of the set of rules comprises aligning a first node 
comprising a pronoun of one dependency structure to a second node of the other dependency structure, if a parent 
node of the first node is aligned with the second node and the second node has no unaligned child nodes. 



55 23. The computer-implemented method of claim 8 wherein one rule of the set of rules comprises aligning a first verb 
node and a respective parent node of one dependency structure to a second verb node of the other dependency 
structure if the first verb node has no tentative correspondences and the parent verb node is aligned with the 
second verb node and where the relationship between the first verb node and the parent verb node comprises a 
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modal relationship. 

24. A computer-implemented method of associating dependency structures from two different languages, wherein the 
dependency structures comprise nodes organized in a parent/child structure, the computer-implemented method 
comprising: 

aligning nodes of the dependency structures as a function of a set of rules, the rules being applied to the nodes 
initially irrespective of the parent/child structure. 

25. A computer readable media having information thereon for a computer-implemented machine translation system 
to translate text from a first language to a second language, the information comprising: 

a piurality of mappings, each mapping indicative of associating a dependency structure of the first language 
with a dependency structure of the second language, wherein at least some of the mappings correspond to 
dependency structures of the first language having varying context with some common elements, and asso- 
ciated dependency structures of the second language to the dependency structures of the first language also 
having varying context with some common elements. 

26. The computer readable media of claim 25 wherein the dependency structures of said at least some of the mappings 
have two common elements in each of the languages. 

27. The computer readable media of claim 25 wherein the dependency structures of said at least some of the mappings 
have three common elements in each of the languages. 

28. The computer readable media of claim 25 wherein the information includes information indicative of a size of each 
dependency structure of the first language. 

29. The computer readable media of claim 25 wherein the information includes information indicative of an extent of 
a complete alignment of the dependency structures of the first language originating from a larger dependency 
structure. 

30. The computer readable media of claim 25 wherein the information includes information indicative of a frequency 
the dependency structure occurred in training data. 
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